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Abstract 

We show that the classical Kac's random walk on S n_1 starting from the point mass at ei mixes in 
0(n 5 log n) steps in total variation distance. This improves a previous bound by Diaconis and Saloff-Coste 
ofO(n 2n ). 

1 Introduction 

Mark Kac proposed the following simplified model of one-dimensional Boltzmann gas dynamics (for 
historical details, see [3], [!]): For n particles on R, we can represent their velocities (vi, . . . ,v„) as a 
point on the unit sphere 5" 1-1 , after normalization so that 

71 

;=i 

. Conservation of kinetic energy (assuming potential energy) in the gas dynamics is equivalent to 
(vi(t), . . . ,v n {t)) staying on S" 1-1 for all t > 0. We will not introduce momentum conservation in our 
model, because that will force the collision to be inelastic (see 2nd paragraph below). 

Each time there is a collision, it occurs with probability 1 between two particles, which corresponds 
to choosing two coordinate directions Xi, Xj and rotating S n ~ 1 along the 2-plane x; A Xj by some angle 6. 
Notice that + before and after the collision stays the same, since the velocities of the other particles 
are not affected by the collision. 

By disregarding the position information of the particles (which have to be confined in some compact 
domain, for example S 1 , else they will eventually run off to infinity), each collision occurs between any 
pair of the particles with equal probability -ppr . The rotation angle 9 can be chosen from some distribution 

on [0, 2-7r), which physically is a measure of the elasticity of the collision; for example, inelastic collision 
in R will correspond to a distribution of 8 that's a delta measure concentrated at -k. In this paper, we 
will assume that 9 is uniformly distributed on [0, 27r). 

Thus we obtain a discrete time Markov chain on 5" 1-1 with transition kernel given by, for / : S n ~ — > R 
continuous, and x £ S n_1 , 

1 n r 27V 1 

where R(i,j;8) denotes the rotation along the oriented i A j plane by the angle 9. By transposing, K 

defines a map from the set of probability measures on S" 1-1 to itself, since K(l) — 1. 

It is not too hard to see that the U n -i, the uniform distribution on S 1 " -1 , is a stationary distribution 

for K: for each summand Kij (without yiy in {l}), we have 

(2) 

U n -i(K i:j f) = f {K^f)(x)U n -4dx) 



Js™- 1 JO 
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S n-1 



f(R(i,j;6)x)-^d8)U n -i(dx) 
^(T f(x)d9)U n ^(R(i,j;9)dx) 

1 27T J () 

f(x)U n - 1 (dx) 
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using a change of variable formula and the fact that U„-i is invariant under rotations. 

This establishes that U n -iKi t j — U n -i for all i 7^ j. Thus their average U n -iK — U n -i as well. 

We further claim that Markov chain is aperiodic because once a point is reached, it can be reached 
in the next step with positive probability density for any rotation. It is also irreducible since along a 
sequence of rotations (ii A i%, . . . ,ik A ik+i) that form a connected spanning graph in K n , the complete 
graph on n vertices, one can transport any point on S" 1-1 to any other point with positive probability 
density; such sequence of rotations certainly occur with positive probability. Thus by general theory of 
Markov chains, we know that with any initial distribution fj, on 5 n_1 , 

lim (J,K l (A) - U n -i{A) = 

l — >oo 

uniformly in A C S. This implies total variation convergence. 

Using the C 2 theory of discrete time Markov chains, it can be shown that if the starting distribution 
fi is m C 2 (S n - 1 ,U n -i), then we get the following convergence bound 

UK 1 -U n - X \\ TV <|| M -l|| £2 (l-i-) i 

by the result in [4], that shows the spectral gap of K is bounded below by In fact it's given exactly 
by , ? +2 n for n > 2. 

J 2n(n— 1) — 

If the initial distribution fi does not have an C 2 density with respect to U n -i, then the C 2 theory 
above fails. The best result for the rate of convergence when the initial distribution is say concentrated 
at one point is given in [3], where to get with e close to U n -i in total variation norm requires 0(n 2n log e) 
steps. The C 2 theory gives a mixing time of 0(2n log e\ \ C 2 ). In the following section, however, we will 
show that by some 'truncation' argument, and triangle inequalities for the total variation distance, one 
can still essentially use the spectral gap to bound the total variation mixing time to 0(n 5 (log n log e) 3 ), 
when the walk starts at ei, a fixed point on S n . It is not clear whether the power on the loge factor 
in the mixing time bound can be reduced to 1. In this sense, the result is not a strict improvement over 
®. 



2 Bounding the Total Variation Distance 

Next we want to bound the convergence rate of the Kac's random walk on S" 1-1 , in total variation 
distance. 

Recall the total variation distance between two probability measures fj, and v on the same probability 
space (S,S) is defined by the following variational quantity: 

1 1/-* — v\\tv = sup — v(A)\ 

ASS 

where S is the er-algebra on S. 

Let Ak be the event that at the kth step of the walk, every pair of coordinates has been used. Then 
we have 

Conditioning on this event, we have the following two claims: 

1. the density g := - of the resulting distribution jj,' k of the conditioned random walk with 
respect to the uniform distribution on S" 1-1 satisfies the following bound 

g(x) < \r^x i \- n (£ i (-]og\x i \) k )C k f[ m! (2) 

i — 1 m — 1 

k k 2 n —n n k 

<C k |mina;i| n (— log | mina;i|) (3) 

i — 1 i — 1 

for some fixed absolute constant C. 

2. For k > —n 2 log n loge, and e < ^—^ — the set H t := {x £ 5 1 " -1 : \x t \ < e for some i} satisfies the 
following bound on its probability 
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(4) 



Let us first show how claims 1 and 2 lead to a polynomial time convergence rate for the Kac walk 
under total variation norm. Let fi k be the distribution on S n ~ after k steps of the random walk, and 
let fjf k be fi k conditional on A k , i.e., for B C 5 n_1 , 



ti' k {B) =P{5 ei R k G B\A k ) 

where R is the one step transition kernel of the Kac random walk. 
Then we have 



W -P*s||tv <Vk < | 2 j(l- -thj) (5) 



To check this, let BcS" be Lebesgue measurable. Then we have 



fj, k (B) = P(5 ei J? fc G B|A fc )P(A fc ) +P(5 ei R k G B|A|)P(A£ 



This implies 



^ fc (B) - n' k (B) < rj k 

On the other hand, since 

, P({5 ei R h GB}nA k ) 
= PjA^) 

we also get 

— > n k {B) 

1 — J?fc 

which gives 



(J,k{B) > n' k (B) - i] k Li' k (B) 

hence 



Hk(B) - Hk{B) < f)k 

which shows {5J. 

Next recall that a Markov kernel is weakly contracting in total variation norm because if / is a 
bounded continuous function on the state space with 

1/11°° < 1 

then Rf(x) — J R(x,dy)f(y) satisfies the same jC°° bound, hence 

(pR - vR)(f) = (p - v)(Rf) < ||m-i/||tv 

Thus by the triangle inequality we just need to bound ||/4,-R ! — U„-i\\tv from now on, where U n -i 
denotes the uniform distribution on S n ~ l and at the end add r) k to the resulting bound. 

Next we modify [i' k to a different distribution v k as following. We define v k in terms of its density 
with respect to U n -\. 
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On the set 

dv k dfj,' k 

dUn-l dUn-l 

On the set H e , we let its density be a constant equal to the mass of H c under n' k divided by its mass 
under U„-i, which is what's needed for v k to be a probability distribution on S"" 1 ; we invoke claim 1 
above to get an upper bound on this constant: 

dv k _ n' k (H e ) 



dUn-l U n -i(B.) 

£ l/4 

< 



r(S) 



< - 



£ l/4 



/ to— 2 
V 2ir 



2tt 

< e 



Here we used the fact that 



r(f) , 
r(2-i) ' 

which follows from log convexity of the V function. 

The total variation distance between fi' k and v k is given simply by their total variation distance over 
the region H e , hence we have 

/ / nr(-) 

\Wk - ^felll-V < Mfc(He) + r (l) r (n-l) e ( 6 ) 
^ 3 / 2 i i/ 4 

< n e + e (7) 

Thus by choosing e sufficiently small, whose exact value we will determine in the end, we can make 
sure that fi' k and v k are very close in total variation distance. And again by weak contractivity of Markov 
kernel, we now simply need to focus on bounding \\v k R l — U„-i\\tv- Since v k has an C 2 density with 
respect to U n -i, we can use the spectral gap to bound the rate of convergence. First we bound the 
£ 2 (dU„-i) distance between v k and U n —i: 

ll^-f/n-l|| £2(d£ /„_ 1 ) = (/ l;^-l| 2 ^n-l+ / |-^-l| 2 dt/„-i)^ (8) 

Let's bound the two integrals separately. 

For the first integral on the right hand side of (H), we have 

f \~^-l\ 2 dU n -,< [ +U n -!(Ht) 
J He dUn-l J H e dUn-l 

-3/2 8?T F(|) 

6 n-2 e F(^)r(I) 



For the second integral, notice first that is the set of points on S n 1 for which all the coordinates 



are greater than e. So claim 2 tells us that the density , JJ Uk over this region is bounded above by e 



from which we immediately get the following bound 



I H c dUn-l 

Combining (|9]) and (|10[) . we get, for e < i and n > 2 say, that 

\\vk - t / n-i|| £ 2 {d , 7ii _ l) < 2e" n 



' / "'" - l| 2 df/„-i< <T 2n + 1 (10) 
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By the results in [4], we know that the spectral gap of the Kac kernel is — , so we get 

||^i?'-[/„-l||TV<|| 1 ^-l|| £2(dU „_ 1 )( 1 - 1 ) m 

aUn-i n 

<2e~ n {l--) m (11) 
n 

Finally combining ([5]) (|6]) and (|11[) . we get 

\\S ei R k+l - f/n-rllxv < Q (1 - -^y) fc + n i/2 e + 6 1/4 + C k k k " \e\~ " (- loge) fe (l - 1)' (12) 



So it remains to minimize the right hand side of (I12|l with respect to k and I. 

Suppose our target total variation distance is 35. Then we can simply divide 35 into three equal parts 
and bound each summand in (I12|l by 5. We look at each summand below: 
bounding the first summand yields 



>k > (— log 5 + 2 log n) ' " 



So it suffices to take 



fc>n 2 lognlogi (13) 



Bounding the second summand e 1 ' 4 + n 3 ^ 2 e < 5, it suffices to have 

e 1 / 4 < 5/2 
n 3/2 e < 5/2 

which gives 



1 j-4 -3/2 

This will be used to bound / from above in the third summand: 



C k k k2 \e\- n (-\oge) k {l--) 1 <5 

implies we need I greater than 

n(— log 5 + k log C + k 2 log fc — nlog e + k log log(e -1 )) 
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< n(— log 5 + fc log C + n 4 (log n) 2 (log 5) 2 (2 log n + log log n + log log(<5 1 )) + n(— 4 log 5 + — log n) + k log log e 1 ) 

< C'n (log ti) 3 (log 5) 3 
for some constant C'. 

Clearly Z dominates k, so it requires a total of C'n 5 (log n) 2 (log 5) 2 steps to bring the running distri- 
bution of the Kac random walk to be 35 close to its stationary distribution on the unit sphere S"^ 1 . 
Lastly we prove the two claims introduced in the beginning. 
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3 Proof of Claim 1 

Starting at the delta mass at ei, an admissible sequence of rotations in will distribute it over the 
entire S n ~ with positive probability everywhere provided that P(A k ) > 0, i.e., for k sufficiently large. 
This will certainly be the case if k > — n 2 log n log 8 for — log 8 > 2. 

Observe that at step j — 1, j < k, the support of the running distribution is a subsphere of S n ~ 1 . 
Without loss of generality, we call this subsphere S m . Denote by Uj,Vj the axes that span the plane of 
rotation "/j. 

The way jj affects the previous running distribution can be classified into three cases: 

1. Uj,Vj S m , in which case the running distribution remains unchanged. 

2. Uj,Vj £ S m , in which case the support at step j is still on S m . 

3. Uj £ S m , Vj S m , in which case the support of the running distribution grows to be a sphere with 
1 dimension higher than 5"™, denoted without loss of generality S m+1 , and the density with respect to 
the uniform distribution on S m+1 is bounded by (it 2 + v 2 )^ 1 ^ 2 times the previous density bound with 
respect to S m . 

Case 1 is clear because the rotation does not take S m outside itself and for 9 £ [0, 27r], the density at 
(xi, . . . , x m +i, . . . , (uj + v 2 ) 1 ^ 2 cos 6, . . . , (u 2 + v 2 ) 1 / 2 sin9, . . . , x„) with respect to U m only depends on 
the first m + 1 coordinates, which means that averaging over 9 uniformly in [0, 27r] remains the same. 

to understand case 3, we have the following 

Lemma 3.1. Assuming the running density h(xi, . . . , x m +i) with respect to U m after step j — 1 is bounded 
by g(xi, . . . x m +i), and that without loss of generality Uj — x m +i, Vj = x m+ 2, then the new density with 
respect to U m +i after step j is bounded by 

1 ( i \l/2w 2 2 

— g{Xl, (X m+ 1 + Xm+2) ){Xm+l + Xm+2) 

Proof. Denote the new density with respect to U m +i by h(xi, . . . , x m+ 2) with a slight abuse of notation. 
Then we have 

+ Xm+2) 111 cos 9, (x 2 m+1 + x 2 m +2 ) 1/2 sin 9) 

is independent of 9 and in particular equals 

h(xi, . . . , (x m+1 + X 2 m + 2) 1/2 ,Q) 

Furthermore the total contribution of density from (xi, . . . , (x'm+i+Xm+2) 1 '' 2 cos 9, (Xm^+Xm^) 1 ^ 2 sin 9) 
for all 9 should add up to the previous density at the point (xi, . . . , (x m +i + x 2 m+2) X ^ 2 )- In other words, 

/27T 
7 / / 2 1 2 \l/2 Q 1 2 1 2 \l/2 • \ 

h(Xi, . . . ,(Xm+l + Xm+2) COS 9, {X m+1 + X m +2) ' Sin 9) 
- =0 

Mr 2 . 2 \l/2\ 

X\, \Xm + \ + X m +2) ) 

Notice that the factor (x m +i + Xm+2) 1 ^ 2 accounts for the measure of the circle over which we aggregate. 
Thus we get 

H{X\, . . . , {Xm + 1 "t" X m +2 

) 1/2 cos 9, (x 2 m+ x + Xm +2 ) 1/2 sin9) 

— ~^( X m+l + Xm+2) ^ fl(xi, . . . , (X m+ i + Xm+2) ^ ) 

^-1/2 2 \-l/2 / , 2 2 \l/2\ 

<-^{x m +i + Xm+2) g(Xl,...,{Xm+l + X m+2 ) ) 

□ 

to study case 2, we make the assumption that after step j — 1, the density with respect to U m is 
bounded by an expression of the form 

C(ai + bjy 1/2 . . . (al^ + b 2 m ^)- 1/2 [{- log Ixil)'- 1 + ... + (- log Ixm+iiy- 1 } (14) 

where C is a constant that varies with j and m. Here Oi / &; for each i and (01, &i), . . . , (a m -i, b m -i) are 
pairs in (xi, . . . ,x m +i) 2 with the property that no two pairs are the same and each coordinate appears 
at most twice. 
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Lemma 3.2. Under the assumption above, if jth rotation is in case 2, then the new density bound takes 
the form 

2C(j + l)!(a? + bjy 1/2 . . . (a^_ a + bl^y^K- log \ Xl \Y + ... + (- log \x m+1 \y\ 

with possibly a different sequence of (ai,bi) satisfying the same property as before. 

Proof. Without loss of generality assume (uj,Vj) — (1,2). 

The new density h' is obtained from the old density h by averaging over 8 € [0, 2n] of h(R(l, 2, 6)x), 
where 72(1, 2, 8)x denotes the rotation of the vector x £ S m by angle 8 along x\ A12. In formula we have, 

h'(x) = —[ h(R(l,2,6)x)d6 (15) 
2tt Jo 

We write the bound (|14|l as a sum of m + 1 terms and consider one of the terms 

gi(x) = C(a\ + b 2 )~ 1/2 . . . (a„_! + 6^_i)" 1/2 (- log \x 1 \) 3 ~ 1 

By assumption, at most two of cti, bi, ■ ■ . , a m -i, b m -i equals xi at most two equals x 2 . 
By the circle averaging formula (I15p , we have 

1 f 2n 

9i( x ) = 7T~ / g{(xl + xl) 1/2 cos9,(xl + xl) 1/2 sin8,x 3 ,. . . ,x m+1 )d9 
2ti- J 

We shall break the integral into two parts, where the range of integration is over I c = [0, 7r/4] U 
[37r/4, 57r/4] U [77T-/4, 2n] and its complement I s in [0, 2n] respectively, i.e., the ranges are where cos# is 
close to 1 and sin 8 is close to 1 respectively. 

For 8 £ I c , all the factors in gi(x) of the form (x 2 + x 2 )~ 1/l2 , that involves X2 but not x\ upon the 
rotation R(l, 2, 8) becomes ((x 2 + x 2 ) sin 2 8 + x 2 ) -1 ^ 2 , which can be bounded above by V2(x 2 + x 2 + 
x 2 )-^ 2 . 

As of the factors that involve both xi and X2, i.e., (x 2 + x 2 ) 1 ^ 2 , there can be at most one of such. 
And it remains the same under the rotation R(l, 2, 8) since (x 2 + x 2 ) cos 2 8 + (x\ + a; 2 .) sin 2 8 — x\ + x 2 . 

The factors that involve x\ and xi, I 7^ 2, becomes ((xi + x\) cos 2 8 + x 2 )- 1/2 , which we can bound 
as follows: 

using the fact that ±(jaj + |6|) < (a 2 + b 2 ) 1/2 < |a| + \b\, we get 

((x\ + x 2 ) cos 2 8 + x 2 )~ 1/2 ~ (|xi| + \x 2 \)\ cos8\ + |^ s |) _1 

where a ~ b means b/C < a < bC for some constant C. Here the constant can be taken to be 2. 
Finally it's also possible that i £ {1,2}, in which case we also have to deal with a (— log[(if + 

^,2^-1/2 cos 0^y~ 1 f ac tor that goes to infinite for 8 £ la- 
in fact when i = 1, the only factors that have singularities for 8 £ I c and for the coordinates bounded 

away from take the following form: 

((H + |xa|)| cos0| + |x s |)- 1 ((|x 1 | + |x a |)| cos0| + M)"^- log[(a; 2 + x 2 2 )~ 1/2 cos8]) j 
where s 7^ t, or without the xt factor. In the former case we show that the following integral 

f (Qsil + |a*|)|cos0| + \x s \y l ((\xi\ + M)| cos0| + \x t \y\- log[(x 2 + x 2 2 ) 1/2 cos 9]y~ l d9 

271" Jg eIc 

is bounded by 

j\(x\ + xly 1/2 (x 2 8 + x 2 t y 1/2 [{- log \ xi \y + (- log \ X2 \y + {- i og \x s \y + (- log \ Xt \y] (ie) 

whereas in the case a; t is not present, the same bound applies to the expression 
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^- [ ((|T 1 | + |x 2 |)|cos^| + | Ss |)- 1 (-log[( a; ?+ a; i) 1/2 cose]) J - 1 ^ (17) 

clearly because 

((|a;i| + H)cos6»+ > 1 

When i 7^ 1, the logarithmic singularity will not arise when integrating over 8 € I c , so it will trail off 
as a remaining factor of the form (— log |xi|) J_1 < 1 + (— log la^l)- 7 . 
Recall also that we have factors of the form 

2{x\ +x 2 2 + x 2 s )- 1/2 (xl + x 2 2 + x 2 )~ 1/2 (18) 

coming from the uniform bound on the factors involving X2 but not x 2 ; here s,t are possibly different 
indices than those appearing in the singular factors. (|18[) can be trivially bounded above by 2(x 2 + 
x 2 s)~ x ^ 2 {x\ + x\ ) -1 ^ 2 . The remaining inverse factors in g(R(l, 2, 8)x) do not contain x\ or xi, so one can 
easily check that the inductive hypothesis is satisfied. 

The best way to visualize this branching inductive argument is to consider a simple graph onm + 1 
vertices with degrees bounded above by 2. The edges between i and j represents a factor of the form 
[x 2 + x 2 )~ 1 ^ 2 in the bound on the density. A rotation in the xi A X2 plane has the effect of producing two 
graphs onm+1 vertices. Without loss of generality let's describe one of those two descendant graphs, 
the one associated with xv 

there will be edges (1,2), (3,4), (1,3), and (2,4) if X3 and £4 were incident to x\ in the previous 
graph, or simply (1, 2) when x\ only has degree 1. If x\ had degree 0, then it remains isolated in the x\ 
component of the descendant graph. In the process of this rewiring, some logarithmic factors are also 
introduced, namely, if (— log |a^3 1 ) ,? 1 or (— log |s4|) J_1 was a factor in the bound for the previous step 
running disribution, then the new bound will have log | a;^ I ) J - If it's a log factor of other coordinates, 
then the exponent remains the same. 

It remains to prove the bound (|16[1 . which is given by the following technical lemma. □ 

Lemma 3.3. For < x t , x s , < xi, X2, 

rl 

((xt + x 2 )9 + x s ) _1 ((a;i + x 2 )8 + a:t) _1 (- Iog[(a;i + x a )0]) i_1 d0 

<4(j + iy.(x s + x t y 1 {x 1 + x 2 y 1 [(-iogx 1 y + (-\ogx 2 y + (-log^) 3 + (-iogx t y] 

Proof. Without loss of generality, we can assume xt < x s . 

First of all the factor ((xi + x 2 )0 + a; s ) _1 can be bounded above by 2(x s + £ t ) -1 for 8 € [0, 1]. So it 
remains to bound the integral of the remaining factors 

/ ((xi + x 2 )8 + xt)- 1 ^ ]og[(xi + x 2 )8]y- 1 d6 
Jo 

<x^ (- log[(a;i + x 2 )8]y- x d8 + (- ]og[(xi + x 2 )e\y^ {{xi + x 2 )8 + x^dd 

<x^ j\e(- log[(xi + x 2 )e\y + (-log[(xi + x 2 )e]y 1 (x 1 + x^' 1 log[(xi + x 2 + Xt)/((xi + x 2 )e + xi)] 
<a; t ~ 1 j !e (- log[(^i + x 2 )e]y + (- log[(xi + xa)e]) s ~ 1 (xi + x 2 ) _1 log[(xi + x 2 )e] 
=Xf 1 j\e(- log[(xi + x 2 )e]y + (- log[(xi + x 2 )e]) 3 ' (xi + x 2 )~* 

Taking e = xt, we obtain the result. □ 



4 Proof of Claim 2 

Here we use the result from [5] that after k = n 2 log n = logs steps the C 2 transportation distance 
between the running distribution of the Kac random walk on S" 1-1 and the uniform distribution U n -i 
is less than e. So by Holder's inequality, the C 1 transportation distance is also less than e. Now suppose 
^k(He) > e 1//4 . We know that the uniform measure U„-i(H e ) is of the order e; in fact using the marginal 
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density formula for a single coordinate on the unit sphere, one sees that it is bounded above by n 3 ^ 2 e, 
and similarly C/„_i(ff2e) < 2n 3//2 e. So for e sufficiently smaller than n~ s / 2 , we can make sure that 

/ 3/2 -.1/4 r, 3/2 . 1/2 

(n ' e) ' — 2en ' > e ' 

then in order to transport the excessive mass in // e under the fcth running distribution of the random 
walk to other places on any transport strategy has to take all the mass outside Hit, which means 

a distance more than e has to be traversed for each particle mass. This shows the C 1 transportation 
cost for smoothing out the region H e is at least ee 1,/2 , which is greater than the total C 1 transportation 
distance, a contradiction. 

5 Acknowledgement 

I would like to thank my advisor Persi Diaconis for introducing me to the wonderful problem of Kac's 
random walk and directing me to the right papers to look at. 

References 

[1] George E. Andrews, Richard Askey, Ranjan Roy. Special Functions. 

[2] Logarithmic derivatives of the Gamma function, scipp.ucsc.edu/ haber/phll6A/psifun.ps 
[3] Persi Diaconis, Laurent Saloff-Coste. Bounds for Kac's Master Equation. 

[4] E. A. Carlen, M. C. Carvalho, M. Loss, Determination of the Spectral Gap for Kac's Master Equation 
and Related Stochastic Evolutions. 

[5] Roberto Imbuzeiro Oliveira. On the convergence to equilibrium of Kac 's random walk on matrices. 



9 



